Performance Modeling of HPC Applications
نویسندگان
چکیده
Performance models of applications enable HPC system designers and centers to gain insight into the most optimal hardware for their applications, giving them valuable information into the components of hardware (for example processors or network) that for a certain investment of time/money will give the most bene t for the applications slated to run on the new system. The task of developing accurate performance models for scienti c application on such complex systems can be di cult. In section 2 we brie y review a framework we developed [1] that provides an automated means for carrying out performance modeling investigations. In section 3 we describe ongoing work to lower the overhead required for obtaining application signatures and also how we increased the level-of-detail of our convolutions with resulting improvements in modeling accuracy. In section 4 we show how these technology advances have enabled performance studies to explain why performance of applications such as POP (Parallel Ocean Program) [2], NLOM (Navy Layered Ocean Model) [3], and Cobalt60 [4] vary on di erent machines and to quantify the performance e ect of various components of the machines. In section 5 we generalize these results to show how these application's performance would likely improve if the underlying target machines were improved in various dimensions (as for example on future architectures).
منابع مشابه
A FEM Multiscale Homogenization Procedure using Nanoindentation for High Performance Concrete
This paper aims to develop a numerical multiscale homogenization method for prediction of elasto-viscoplastic properties of a high performance concrete (HPC). The homogenization procedure is separated into two-levels according to the microstructure of the HPC: the mortar or matrix level and the concrete level. The elasto-viscoplastic behavior of individual microstructural phases of the matrix a...
متن کاملPerformance Evaluation and Modeling of HPC I/O on Non-Volatile Memory
HPC applications pose high demands on I/O performance and storage capability. The emerging non-volatile memory (NVM) techniques offer low-latency, high bandwidth, and persistence for HPC applications. However, the existing I/O stack are designed and optimized based on an assumption of disk-based storage. To effectively use NVM, we must reexamine the existing high performance computing (HPC) I/O...
متن کاملProceedings of the 2017 Winter Simulation Conference
High-performance Computing (HPC) systems have gone through many changes during the past two decades in their architectural design to satisfy the increasingly large-scale scientific computing demand. Accurate, fast, and scalable performance models and simulation tools are essential for evaluating alternative architecture design decisions for the massive-scale computing systems. This paper recoun...
متن کاملApplying an Automated Framework to Produce Accurate Blind Performance Predictions of Full-Scale HPC Applications
This work builds on an existing performance modeling framework that has been proven effective on a variety of HPC systems. This paper will illustrate the framework’s power by creating blind predictions for three systems as well as establishing sensitivity studies to advance understanding of observed and anticipated performance of both architecture and application. The predictions are termed bli...
متن کاملSONAR: Automated Communication Characterization for HPC Applications
Future computing systems will need to operate within hard power and energy constraints, this is particularly true for Exascale-class systems. These constraints are hard for technical, economical and ecological reasons, thus, such systems have to operate within given power and energy budgets. Therefore, we anticipate the need for modeling tools that help to predict power and energy consumption. ...
متن کاملAutoMatch: Automated Matching of Compute Kernels to Heterogeneous HPC Architectures
HPC systems contain a wide variety of heterogeneous computing resources, ranging from general-purpose CPUs to specialized accelerators. Porting sequential applications to such systems for achieving high performance requires significant software and hardware expertise as well as extensive manual analysis of both the target architectures and applications to decide the best performing architecture...
متن کامل